NVIDIA
Explore Models Blueprints GPUs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright Ā© 2025 NVIDIA Corporation

ModelsExplore Models
BlueprintsGet Started with Blueprints
GPUsLaunch a GPU Instance

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for developmentAccelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes

Most Popular Models

View All

The leading open models built by the community, optimized and accelerated by NVIDIA's enterprise-ready inference runtime.

PREVIEW

metallama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answering
PREVIEW

metallama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answering
Run Anywhere

nvidiallama-3.1-nemotron-ultra-253b-v1

Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.

advanced reasoningfunction callinginstruction followingmath
Run Anywhere

nvidiallama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

advanced reasoningfunction callinginstruction followingmath
Run Anywhere

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

mathadvanced reasoningchat
PREVIEW

nvidiallama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

advanced reasoningfunction callinginstruction followingmath
PREVIEW

googlegemma-3-27b-it

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

language generationvision assistantvisual question answeringimage-to-text
PREVIEW

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

chart and table understandinglanguage generationspeech recognitionvisual qaimage-to-text
Run Anywhere

nvidiacosmos-predict1-7b

Generates physics-aware video world states from text and image prompts for physical AI development.

physical aiimage-to-worldroboticstext-to-worldsynthetic data generation

Create AI Agents

View All

Blueprints to build and deploy Agentic AI applications, digital twins, etc.

nvidiaBuild an AI Agent for Research and Reporting

Create AI agents that reason, plan, reflect and refine to produce high-quality reports based on source materials of your choice.

blueprintllama nemotronnimnemo retrieverreasoningretrieval-augmented generationnvidia ai

nvidiaLLM Router

Route LLM requests to the best model for the task at hand.

blueprintllm routernvidia ai

nvidiaPDF to Podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

conversational aimulti-modalpdf-to-podcasttext-to-speechblueprintnvidia aiai agenttext-to-speech

nvidiaBuild an Enterprise RAG pipeline

Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.

blueprintnimnemo retrieverretrieval-augmented generationnvidia ai

crewaiCode Documentation for Software Development

Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.

ai agentsblueprintcode documentationcrewaipartnernvidia ai

langchainStructured Report Generation

Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM

ai agentsblueprintlanggraphpartnerreport generationnvidia ai

llamaindexDocument Research Assistant for Blog Creation

Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.

ai agentsblog creationblueprintllamaindexpartnernvidia ai

pipecatVoice Agent Framework for Conversational AI

Automate voice AI agents with NVIDIA NIM microservices and Pipecat.

ai agentsblueprintconversational aipartnerpipecatnvidia ai

wandbTraceability for Agentic AI

Trace and evaluate AI Agents with Weights & Biases.

ai agentsblueprintpartnertraceabilitywandbnvidia ai
DiscoverModelsBlueprintsGPUs
Docs
Forums
models
ReasoningVisionVisual DesignRetrievalSpeechBiologySimulationClimate & WeatherSafety & Moderation
industries
AutomotiveGamingHealthcareIndustrialRobotics

Discover

AI Models for RTX AI PCs and Workstations

View All

Spanning language, speech, animation, content generation, and vision capabilities, run NVIDIA NIM microservices on your RTX AI PC.

Run Anywhere

deepseek-aideepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

distillationcodingmathreasoningrun on rtx
Run Anywhere

black-forest-labsFLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

run on rtximage generationtext-to-image
Run Anywhere

nv-mistralaimistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

chatcode generationlanguage generationtext-to-textrun on rtxcode generation
Run Anywhere

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

chatlanguage generationrun on rtxtext-to-textcode generation
Run Anywhere

nvidiaparakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

asrbatchenglishfastnvidia nimrun on rtxstreamingspeech-to-text
Run Anywhere

nvidiastudiovoice

Enhance speech by correcting common audio degradations to create studio quality speech output.

digital humannvidia maxinerun on rtxspeech enhancementspeech-to-speech
Run Anywhere

nvidiallama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

embeddingnemo retrieverrun on rtxretrieval augmented generationtext-to-embedding
Run Anywhere

nvidianvclip

NV-CLIP is a multimodal embeddings model for image and text.

computer visionnvidia nimrun on rtxmultimodal embeddingstext and image
Run Anywhere

baidupaddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

optical character detectionoptical character recognitiontable extractiondata ingestionextractionnemo retrieverrun on rtx
Run Anywhere

nvidianv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

chart detectiondata ingestionobject detectiontable detectionextractionnemo retrieverrun on rtx

Develop Physical AI

View All

Pre-trained foundation models and blueprints for digital twins, synthetic data generation, and robotic simulation to accelerate Physical AI development.

nvidiaTest Multi-Robot Fleets for Industrial Automation

Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.

blueprintnvidia omniverseindustrialomniverse blueprintsimulation

nvidiaSynthetic Manipulation Motion Generation for Robotics

Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.

blueprinthumanoidsnvidia isaac gr00tnvidia omniverseimage-to-worldphysical airobot learningroboticssynthetic datateleoptext-to-world

nvidiacosmos-predict1-7b

Generates physics-aware video world states from text and image prompts for physical AI development.

physical aiimage-to-worldroboticstext-to-worldsynthetic data generation

nvidiacosmos-predict1-5b

Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.

physical aipolicy evaluationroboticssynthetic data generationvideo-to-world

nvidiaBuild a Digital Twin for Interactive Fluid Simulation

This NVIDIA Omniverseā„¢ Blueprint demonstrates how commercial software vendors can create interactive digital twins.

blueprintcaecomputer-aided-engineeringexternal aerodynamicsnvidia omniversesimulation

Accelerate Your Simulation Workflows

View All

Blueprints to help you expedite simulation and development with NVIDIA Omniverse.

nvidiaAI Weather Analytics with Earth-2

Develop AI powered weather analysis and forecasting application visualizing multi-layered geospatial data.

ai weather predictionblueprintclimate scienceearth-2nvidia aiweather simulation

nvidiaBuild a Digital Twin for Interactive Fluid Simulation

This NVIDIA Omniverseā„¢ Blueprint demonstrates how commercial software vendors can create interactive digital twins.

blueprintcaecomputer-aided-engineeringexternal aerodynamicsnvidia omniversesimulation

nvidiaBuild a Digital Human

Create intelligent, interactive avatars for customer service across industries

audio-to-faceblueprintchatdigital humansspeech-to-textnvidia ainvidia omniverse

nvidia3D Conditioning for Precise Visual Generative AI

Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset.

blueprintnvidia omniversesimulationvisual design

nvidiaSynthetic Manipulation Motion Generation for Robotics

Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.

blueprinthumanoidsnvidia isaac gr00tnvidia omniverseimage-to-worldphysical airobot learningroboticssynthetic datateleoptext-to-world

nvidiaTest Multi-Robot Fleets for Industrial Automation

Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.

blueprintnvidia omniverseindustrialomniverse blueprintsimulation

Accelerated Computing for Digital Biology

View All

AI-driven drug discovery and accelerated genomics workflows.

nvidiaGenomics Analysis

Easily run essential genomics workflows to save time leveraging Parabricks

biologyblueprintgenomicsparabricksnvidia aidna sequencing

nvidiaSingle Cell Analysis

Investigate, understand, and interpret single cell data in minutes, not days by leveraging RAPIDS-singlecell, powered by NVIDIA RAPIDS

biologyblueprintgenomicsrapidssingle cellnvidia airna sequencing

nvidiaBuild A Generative Protein Binder Design Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.

bionemobiologyblueprintprotein generationnvidia bionemodrug discovery

nvidiaBuild A Generative Virtual Screening Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.

bionemoblueprintchemistrydockingnimnvidia bionemodrug discovery